Glossary extraction and utilization in the information search and delivery system for IBM Technical Support

نویسندگان

  • Lev Kozakov
  • Youngja Park
  • Tong-Haing Fin
  • Youssef Drissi
  • Yurdaer N. Doganata
  • Thomas Cofino
چکیده

In this paper we describe the practical aspects of extracting and using a glossary for a selected technical domain. We first describe the existing glossary extraction process, as applied to general corpora, and examine its shortcomings in the technical support domain. Then we propose a number of enhancements to it, including focusing the glossary on a selected domain context, providing support for multidomain glossaries, and importing domain-specific dictionaries. We apply our focused-glossary approach to the IBM Technical Support corpus and incorporate resulting glossaries within the information search and delivery system used by IBM Technical Support. We demonstrate the effectiveness of our approach by evaluating the quality of keywords and terms extracted from sample documents with the help of these glossaries.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Qualitative Assessment of the Evidence Utilization for Health Policy-Making on the Basis of SUPPORT Tools in a Developing Country

Background SUPPORT tools consist of 18 articles addressing the health policy-makers so that they can learn how to make evidence-informed health policies. These tools have been particularly recommended for developing countries. The present study tries to explain the process of evidence utilization for developing policy documents in the Iranian Ministry of Health and Medical Education (MoHME) and...

متن کامل

Making Decision Support System for Utilization of Biogas in Iran

The use of renewable energy sources is often suggested to be a good solution for climate change and the dependency to fossil fuel. Biogas utilization is a one of these promising options that can mitigate these problems since biogas is produced by the fermentation of waste, so is rich in methane and has the same characteristics as natural gas. Biogas has increasingly been noticed in different co...

متن کامل

Information Architecture of Research Institutes’ Website, Case Study: Iranian Research Institute for Information Science and Technology’s Website

Purpose: As mission-oriented organizations, research institutes have the task of answering community questions in specialized areas, and should therefore be able to effectively present their outputs to their target users. Achieving such a goal requires the proper use of information architecture principles to properly organize the information platform in which the research institutes interact wi...

متن کامل

Designing educators’ information literacy model at the agricultural technical schools in Mazandaran province, Iran.

Information literacy embraces the ability to access useful information, to have the awareness organizing knowledge and information requiring different methods of search and most effective diagnostic information for problem solving and decision-making. Those who lack these abilities are continuously confused in the vast ocean of information. This study investigated designing model of educators’ ...

متن کامل

Review of ranked-based and unranked-based metrics for determining the effectiveness of search engines

Purpose: Traditionally, there have many metrics for evaluating the search engine, nevertheless various researchers’ proposed new metrics in recent years. Aware of this new metrics is essential to conduct research on evaluation of the search engine field. So, the purpose of this study was to provide an analysis of important and new metrics for evaluating the search engines. Methodology: This is ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • IBM Systems Journal

دوره 43  شماره 

صفحات  -

تاریخ انتشار 2004